Towards Emotive Annotation in plWordNet 4.0
نویسندگان
چکیده
The paper presents an approach to building a very large emotive lexicon for Polish based on plWordNet. An expanded annotation model is discussed, in which lexical units (word senses) are annotated with basic emotions, fundamental human values and sentiment polarisation. The annotation process is performed manually in the 2+1 scheme by pairs of linguists and psychologies. Guidelines referring to the usage in corpora, substitution tests as well linguistic properties of lexical units (e.g. derivational associations) are discussed. Application of the model in a substantial extension of the emotive annotation of plWordNet is presented. The achieved high inter-annotator agreement shows that with relatively small workload a promising emotive resource can be created.
منابع مشابه
Context-sensitive Sentiment Propagation in WordNet
Current state of emotive annotation of plWordNet: •more than 83k annotations covering more than 54k lexical units and 41k synsets • 22k polarity annotations different than neutral (13k of lexical units and 9k synsets) • 1.5k synsets with different polarity across their units –without neutral units, only 345 of synsets with varying polarity strength –without neutral and ambiguous annotations, on...
متن کاملTowards Mapping Thesauri onto plWordNet
plWordNet, the wordnet of Polish, has become a very comprehensive description of the Polish lexical system. This paper presents a plan of its semi-automated integration with thesauri, terminological databases and ontologies, as a further necessary step in its development. This will improve linking of plWordNet into Linked Open Data, and facilitate applications in, e.g., WSD, keyword extraction ...
متن کاملplWordNet as the Cornerstone of a Toolkit of Lexico-semantic Resources
A wordnet is many things to many people: a graph of inter-related lexicalised concepts, a taxonomy, a thesaurus, and so on. A wordnet makes good sense as the mainstay of any deep automated semantic analysis of text. We have begun the construction of a multi-component, multi-use toolkit of natural language processing tools with plWordNet, a very large Polish wordnet, at its centre. The component...
متن کاملA Large Wordnet-based Sentiment Lexicon for Polish
The applications of plWordNet, a very large wordnet for Polish, do not yet include work on sentiment and emotions. We present a pilot project to annotate plWordNet manually with sentiment polarity values and basic emotion values. We work with lexical units, plWordNet’s basic building blocks.1 So far, we have annotated about 30,000 nominal and adjectival LUs. The resulting lexicon is already one...
متن کاملImplementation of the Verb Model in plWordNet 4.0
The paper presents an expansion of the verb model for plWordNet – the wordnet of Polish. A modified system of constitutive features (register, aspect and verb classes), synset and lexical relations is presented. A special attention is given to the proposed new relations and changes in the verb classification. We discuss also the results of its verification by application to the description of a...
متن کامل